Morphological Analysis

نویسنده

  • Martin Kay
چکیده

A computer program that is intended to carry out nontrivial operations on texts in an ordinary language must start by recognizing the words that the text is made up of. This is the procedure I call morphological analysis. It is necessary because the linguistically interesting properties of words cannot be discovered by examining the words themselves but are associated with them in an essentially arbitrary manner. Therefore, there must be a list what we call a dictionary to define the mapping of words into linguistically interesting properties and a process to look words up in this dictionary. Many computer programs have been written in which morphological analysis consists of nothing more than accepting any unbroken string of letters encountered in a text as a word and referring it to a dictionary. This means that, in addition to what is usually found there, the dictionary must contain plural forms of norms, all the forms of every verb, regular or irregular, all adverbs, and so forth. A machine dictionary of English constructed on these principles would contain four to six times as many entries as a standard dictionary but some of these entries could presumably consist of little more than a reference to the standard form of the word the singular of the noun, the infinitive of the verb, or whatever. A modern computer could easily accommodate a dictionary of English enlarged in this way and it is an attractive thing to do if only because it reduces the problem of morphological analysis almost to triviality. The increase in the size of the dictionary is more alarming in the case of a highly inflected language. There are, however, many languages for which this solution is unthinkable and many for which it is clearly undesirable. In ancient Greek, Latin, and Sanskrit, for example, it was not customary to leave spaces between words so that

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fault Diagnosis Method for Automaton based on Morphological Component Analysis and Ensemble Empirical Mode Decomposition

In the fault diagnosis of automaton, the vibration signal presents non-stationary and non-periodic, which make it difficult to extract the fault features. To solve this problem, an automaton fault diagnosis method based on morphological component analysis (MCA) and ensemble empirical mode decomposition (EEMD) was proposed. Based on the advantages of the morphological component analysis method i...

متن کامل

A Fault Diagnosis Method for Automaton Based on Morphological Component Analysis and Ensemble Empirical Mode Decomposition

In the fault diagnosis of automaton, the vibration signal presents non-stationary and non-periodic, which make it difficult to extract the fault features. To solve this problem, an automaton fault diagnosis method based on morphological component analysis (MCA) and ensemble empirical mode decomposition (EEMD) was proposed. Based on the advantages of the morphological component analysis method i...

متن کامل

Morphological phylogenetic analysis of the genera Fragaria and Duchesnea in Iran

In this research phylogenetic relationships of the two genera Fragaria and Duchesnea, including four species ( Fragaria viridis, F. vesca, Duchesnea indica and D. chrysantha) and 2 of their closely related species (Potentilla reptans and P. micrantha) plus 2 Fillipendulla species ( representing outgroups) were carried out using morphological traits. Primarily, morphological evidences of 30 taxa...

متن کامل

Genetic Diversity Analysis of Maize Hybrids Through Morphological Traits and Simple Sequence Repeat Markers

Comparing different methods of estimating the genetic diversity could define their usefulness in plant breeding programs. In this study, a total of 18 morphological traits and 20 simple sequence repeat (SSR) loci were used to study the morphological and genetic diversity among 20 maize hybrids selected from different countries, and to classify the hybrids into groups based on molecular profiles...

متن کامل

A Taxonomic Reassessment of Consolida (Ranunculaceae) Species: Insight from Morphological and Molecular Data

In order to compare the efficiency of morphological traits and molecular markers in distinguishing the Consolida species, molecular analysis using nrDNA ITS and cpDNA trnL-trnF with maximum parsimony and Bayesian methods were done in a total of 34 species and forma representing 28 species of Consolida, 6 species of Aconitella, plus two species of Delphinium and two species of Aconitum as out gr...

متن کامل

Measurement of Morphological Characteristics of Raw Cane Sugar Crystals Using Digital Image Analysis

Raw cane sugar is one of the most important product in the sugar industry and is the main raw material for the white sugar production. Morphological and physical properties of this product might influence the final white sugar. For instance, the behavior during centrifugation, transport and storage is related to the characteristics of these crystals. The object of this study was to determine th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1973